NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Gács-Körner Common Information Variational Autoencoder

Kleinman, Michael; Achille, Alessandro; Soatto, Stefano; Kao, Jonathan C (February 2024, NeurIPS)

We propose a notion of common information that allows one to quantify and separate the information that is shared between two random variables from the information that is unique to each. Our notion of common information is defined by an optimization problem over a family of functions and recovers the Gács-Körner common information as a special case. Importantly, our notion can be approximated empirically using samples from the underlying data distribution. We then provide a method to partition and quantify the common and unique information using a simple modification of a traditional variational auto-encoder. Empirically, we demonstrate that our formulation allows us to learn semantically meaningful common and unique factors of variation even on high-dimensional data such as images and videos. Moreover, on datasets where ground-truth latent factors are known, we show that we can accurately quantify the common information between the random variables.
more » « less
Full Text Available
Spacetime-Efficient Low-Depth Quantum State Preparation with Applications

https://doi.org/10.22331/q-2024-02-15-1257

Gui, Kaiwen; Dalzell, Alexander M.; Achille, Alessandro; Suchara, Martin; Chong, Frederic T. (February 2024, Quantum)

We propose a novel deterministic method for preparing arbitrary quantum states. When our protocol is compiled into CNOT and arbitrary single-qubit gates, it prepares an $N$ -dimensional state in depth $O (\log (N))$ and $spacetime allocation$ (a metric that accounts for the fact that oftentimes some ancilla qubits need not be active for the entire circuit) $O (N)$ , which are both optimal. When compiled into the ${H, S, T, C N O T}$ gate set, we show that it requires asymptotically fewer quantum resources than previous methods. Specifically, it prepares an arbitrary state up to error $ϵ$ with optimal depth of $O (\log (N) + \log (1 / ϵ))$ and spacetime allocation $O (N \log (\log (N) / ϵ))$ , improving over $O (\log (N) \log (\log (N) / ϵ))$ and $O (N \log (N / ϵ))$ , respectively. We illustrate how the reduced spacetime allocation of our protocol enables rapid preparation of many disjoint states with only constant-factor ancilla overhead – $O (N)$ ancilla qubits are reused efficiently to prepare a product state of $w$ $N$ -dimensional states in depth $O (w + \log (N))$ rather than $O (w \log (N))$ , achieving effectively constant depth per state. We highlight several applications where this ability would be useful, including quantum machine learning, Hamiltonian simulation, and solving linear systems of equations. We provide quantum circuit descriptions of our protocol, detailed pseudocode, and gate-level implementation examples using Braket.
more » « less
Full Text Available
Redundant Information Neural Estimation

https://doi.org/10.3390/e23070922

Kleinman, Michael; Achille, Alessandro; Soatto, Stefano; Kao, Jonathan C. (July 2021, Entropy)

We introduce the Redundant Information Neural Estimator (RINE), a method that allows efficient estimation for the component of information about a target variable that is common to a set of sources, known as the “redundant information”. We show that existing definitions of the redundant information can be recast in terms of an optimization over a family of functions. In contrast to previous information decompositions, which can only be evaluated for discrete variables over small alphabets, we show that optimizing over functions enables the approximation of the redundant information for high-dimensional and continuous predictors. We demonstrate this on high-dimensional image classification and motor-neuroscience tasks.
more » « less
Full Text Available
Usable Information and Evolution of Optimal Representations During Training

Kleinman, Michael; Achille, Alessandro; Idnani, Daksh; Kao, Jonathan C (January 2021, International Conference on Learning Representations (ICLR))
null (Ed.)
We introduce a notion of usable information contained in the representation learned by a deep network, and use it to study how optimal representations for the task emerge during training. We show that the implicit regularization coming from training with Stochastic Gradient Descent with a high learning-rate and small batch size plays an important role in learning minimal sufficient representations for the task. In the process of arriving at a minimal sufficient representation, we find that the content of the representation changes dynamically during training. In particular, we find that semantically meaningful but ultimately irrelevant information is encoded in the early transient dynamics of training, before being later discarded. In addition, we evaluate how perturbing the initial part of training impacts the learning dynamics and the resulting representations. We show these effects on both perceptual decision-making tasks inspired by neuroscience literature, as well as on standard image classification tasks.
more » « less
Full Text Available

Search for: All records